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IN THE CLAIMS 
Please amend the claims as follows: 

Claim 1 (Currently Amended): A document classification system for classifying [[a]] 
docum e nt documents based on contents of the document documents of which contents 
contains a plurality of items, said document classification system comprising: 

inputting means for inputting a group of document data , each document data 
corresponding to the docum e nt documents ; 

displaying means for displaying the plurality of items of the document data input by 
said inputting means , wherein the items are shared by the group of document data ; 

designating means for designating a portion of the plurality of the items displayed in 
said displaying means; 

converting means for converting [[the]] each document data into converted data so 
that the converted data only contains only data corresponding to the items designated by said 
designating means which arc a portion l e ss than all of the it e ms of th e document data ; and 

classifying means for classifying the document by using the converted data produced 
by said converting means. 

Claim 2 (Original): The document classification system as claimed in 1, wherein said 
classifying means includes document vector producing means for producing a feature vector 
representing a feature of the converted data so as to classify the document in accordance with 
the feature vector produced by said document vector producing means. 

Claim 3 (Original): The document classification system as claimed in 1, wherein said 
converting means includes separation sign inserting means for inserting a predetermined sign 
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between sets of data corresponding to the items so as to facilitate separation of each data 
corresponding to each item in the converted data. 

Claim 4-6 (Canceled). 

Claim 7 (Currently Amended): A processor readable medium storing program code 
causing a computer to classify a docum e nt documents based on contents of the docum e nt 
documents of which contents contains a plurality of items, comprising: 

first program code means for inputting a group of document dat a, each document data 
corresponding to the docum e nt documents ; 

second program code means for displaying the plurality of items of the document data 
input by said first program code means , wherein the items are shared by the group of 
document data : 

third program code means for designating a portion of the plurality of the items 
displayed by the second program code means; 

fourth program code means for converting [[the]] each document data into converted 
data so that the converted data only contains only data corresponding to the items designated 
by the third program code means which ar e a portion l e ss than all of th e it e ms of th e 
docum e nt data ; and 

fifth program code means for classifying the document by using the converted data 
produced by the fourth program code means. 

Claim 8 (Previously Presented): The processor readable medium as claimed in 7, 
wherein the fifth program code means includes sixth program code means for producing a 
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feature vector representing a feature of the converted data so as to classify the document in 
accordance with the feature vector. 

Claim 9 (Previously Presented): The processor readable medium as claimed in 7, 
wherein the fourth program code means includes seventh program code means for inserting a 
predetermined sign between sets of data corresponding to the items so as to facilitate 
separation of each data corresponding to each item in the converted data. 

Claim 10 (Previously Presented): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected with a dimensional number different from a 
dimensional number of the document feature vector, the transforming function calculating 
means calculating the representation transforming function by using an inner product 
calculated between the document feature vectors; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; and 
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classification result storing means for storing a result of classification performed by 
the classification means. 

Claim 1 1 (Original): The document classification system as claimed in 10, further 
comprising inner product calculating means for calculating an inner product between the 
document feature vectors, wherein said representation transforming function calculating 
means calculates the representation transforming function by using the inner product. 

Claim 12 (Previously Presented): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; 

classification result storing means for storing a result of classification performed by 
the classification means; 
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inner product calculating means for calculating an inner product between the 
document feature vectors, wherein said representation transforming function calculating 
means calculates the representation transforming function by using the inner product; and 

document similarity information setting means for setting document similarity setting 
information including data representing an author of the document and a date of production 
of the document, wherein said representation transforming function calculating means 
calculates the representation transforming function by using the inner product and the 
document similarity information. 

Claim 13 (Original): The document classification system as claimed in 10, further 
comprising: 

vector storing means for storing the document feature vector produced by said vector 
producing means; and 

transforming function storing means for storing the representation transforming 
function calculated by said representation transforming function calculating means. 

Claim 14 (Previously Presented): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 
vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 
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transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; 

classification result storing means for storing a result of classification performed by 
the classification means; and 

vector correcting means for correcting the document feature vector before the 
document feature vector is transformed by said vector transforming means, a correction being 
performed by processing one of the document feature vector and a feature dimension 
constituting the document feature vector in accordance with a rule established by 
characteristics of words extracted by said analyzing means. 

Claim 15 (Original): The document classification system as claimed in 14, further 
comprising transforming function correcting means for correcting the representation 
transforming function calculated by said transforming function calculating means when the 
feature dimension is changed due to a correction of the document feature vector by said 
vector correcting means so that the document feature vector is transformed by said vector 
transforming means in accordance with the changed feature dimension. 

Claim 16 (Previously Presented): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 
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input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; 

classification-result storing means for storing a result of classification performed by 
the classification means; 

transforming function correction instructing means for sending an instruction 
regarding a process to be applied on a feature dimension of the representation transforming 
function; and 

transforming function correcting means for correcting the representation transforming 
function based on a content of the instruction sent from said transforming function correction 
instructing means. 

Claim 17 (Original): The document classification system as claimed in 16, wherein 
the process indicated in the content of the instruction is performed by using data of an 
arbitrary document vector. 
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Claim 18 (Original): The document classification system as claimed in 16, wherein 
the process indicated in the content of the instruction is performed by using the document 
feature vectors. 

Claim 19 (Original): The document classification system as claimed in 16, wherein 
the process indicated in the content of the instruction is performed by using the analysis 
information obtained by said analyzing means. 

Claim 20 (Original): The document classification system as claimed in 16, wherein 
the process indicated in the content of the instruction is performed by using the result of 
classification stored in said classification-result storing means. 

Claim 21 (Previously Presented): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 
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classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; 

classification result storing means for storing a result of classification performed by 
the classification means; 

an initial cluster centroid designating means for designating an initial cluster centroid; 

and 

initial cluster centroid registering means for registering the initial cluster centroid 
designated by said initial cluster centroid designating means, 

wherein said classification means classifies the document in accordance with the 
initial cluster centroid registered by said initial cluster centroid registering means. 

Claim 22 (Original): The document classification system as claimed in 21, wherein 
the initial cluster centroid designated by said initial cluster centroid designating means is 
arbitrary document vector data. 

Claim 23 (Original): The document classification system as claimed in 21, wherein 
the initial cluster centroid designated by said initial cluster centroid designating means is the 
document feature vector. 

Claim 24 (Original): The document classification system as claimed in 21, wherein 
the initial cluster centroid designated by said initial cluster centroid designating means is the 
analysis information obtained by said analyzing means. 
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Claim 25 (Original): The document classification system as claimed in 21, wherein 
the initial cluster centroid designated by said initial cluster centroid designating means is the 
result of classification stored by said classification-result storing means. 

Claims 26-41 (Canceled). 

Claim 42 (Previously Presented): A processor readable medium storing program code 
causing a computer to classify a document according to contents of the document, 
comprising: 

first program code means for inputting document data of the document; 
second program code means for analyzing the document data so as to obtain analysis 
information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected with a dimensional number different from a 
dimensional number of the document feature vector, the fourth program code means 
calculating the representation transforming function by using an inner product calculated 
between the document feature vectors; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; and 
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seventh program code means for storing a result of classification performed by the 
classification means. 

Claim 43 (Original): The processor readable medium as claimed in 42, further 
comprising eighth program code means for calculating an inner product between the 
document feature vectors, wherein the representation transforming function is calculated by 
using the inner product. 

Claim 44 (Previously Presented): A processor readable medium storing program code 
causing a computer to classify a document according to contents of the document, 
comprising: 

first program code means for inputting document data of the document; 
second program code means for analyzing the document data so as to obtain analysis 
information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; 
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eighth program code means for calculating an inner product between the document 
feature vectors, wherein the representation transforming function is calculated by using the 
inner product; and 

ninth program code means for setting document similarity setting information 
including data representing an author of the document and a date of production of the 
document, wherein the representation transforming function is calculated by using the inner 
product and the document similarity information. 

Claim 45 (Original): The processor readable medium as claimed in 42, further 
comprising: 

tenth program code means for storing the document feature vector produced by the 
third program code means; and 

eleventh program code means for storing the representation transforming function 
calculated by the fourth program code means. 

Claim 46 (Previously Presented): A processor readable medium storing program code 
causing a computer to classify a document according to contents of the document, 
comprising: 

first program code means for inputting document data of the document; 
second program code means for analyzing the document data so as to obtain analysis 
information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 
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fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; and 

eighth program code means for correcting the document feature vector before the 
document feature vector is transformed by the fifth program code means, a correction being 
performed by processing one of the document feature vector and a feature dimension 
constituting the document feature vector in accordance with a rule established by 
characteristics of words extracted by the second program code means. 

Claim 47 (Previously Presented): The processor readable medium as claimed in 46, 
further comprising ninth program code means for correcting the representation transforming 
function calculated by the fourth program code means when the feature dimension is changed 
due to a correction of the document feature vector by the eighth program code means so that 
the document feature vector is transformed by the fifth program code means in accordance 
with the changed feature dimension. 

Claim 48 (Previously Presented): A processor readable medium storing program code 
causing a computer to classify a document according to contents of the document, 
comprising: 
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first program code means for inputting document data of the document; 
second program code means for analyzing the document data so as to obtain analysis 
information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; 

eighth program code means for sending an instruction regarding a process to be 
applied on a feature dimension of the representation transforming function; and 

ninth program code means for correcting the representation transforming function 
based on a content of the instruction sent by the eighth program code means. 

Claim 49 (Previously Presented): A processor readable medium storing program code 
causing a computer to classify a document according to contents of the document, 
comprising: 

first program code means for inputting document data of the document; 
second program code means for analyzing the document data so as to obtain analysis 
information; 
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third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; 

eighth program code means for designating an initial cluster centroid; and 

ninth program code means for registering the initial cluster centroid designated by the 
eighth program code means, 

wherein the document is classified in accordance with the initial cluster centroid 
registered by the ninth program code means. 

Claim 50 (Canceled). 

Claim 51 (New): The document classification system as claimed in claim 1 5 wherein 
the designated portion is less than all of the plurality of items of the document data. 

Claim 52 (New): The document classification system as claimed in claim 7, wherein 
the designated portion is less than all of the plurality of items of the document data. 
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